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This paper develops empirical likelihood methodology for irreg¬ 
ularly spaced spatial data in the frequency domain. Unlike the fre¬ 
quency domain empirical likelihood (FDEL) methodology for time 
series (on a regular grid), the formulation of the spatial FDEL needs 
special care due to lack of the usual orthogonality properties of the 
discrete Fourier transform for irregularly spaced data and due to 
presence of nontrivial bias in the periodogram under different spatial 
asymptotic structures. A spatial FDEL is formulated in the paper 
taking into account the effects of these factors. The main results of 
the paper show that Wilks’ phenomenon holds for a scaled version 
of the logarithm of the proposed empirical likelihood ratio statistic 
in the sense that it is asymptotically distribution-free and has a chi- 
squared limit. As a result, the proposed spatial FDEL method can 
be used to build nonparametric, asymptotically correct confidence 
regions and tests for covariance parameters that are defined through 
spectral estimating equations, for irregularly spaced spatial data. In 
comparison to the more common studentization approach, a major 
advantage of our method is that it does not require explicit estimation 
of the standard error of an estimator, which is itself a very difficult 
problem as the asymptotic variances of many common estimators de¬ 
pend on intricate interactions among several population quantities, 
including the spectral density of the spatial process, the spatial sam¬ 
pling density and the spatial asymptotic structure. Results from a 
numerical study are also reported to illustrate the methodology and 
its finite sample properties. 
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1. Introduction. In recent years, there has been a surge in research in¬ 
terest in the analysis of spatial data using the frequency domain approach; 
see, for example. Hall and Path [14], Im, Stein and Zhu [15], Puentes [10, 11], 
Matsuda and Yajima [24] and the references therein. An intent of frequency 
domain analysis is to allow for inference about covariance structures through 
a data transformation and possibly without a full spatial model, though this 
approach has complications. In contrast to the time series case where ob¬ 
servations are usually taken at regular points in time, the data sites are 
typically irregularly spaced for random processes observed over space. The 
lack of a fixed spacing and possible nonuniformity of the (irregularly spaced) 
data-locations destroy the orthogonality properties of the sine- and cosine- 
transforms of the data, making Fourier analysis in such problems a chal¬ 
lenging task. In a recent paper, Bandyopadhyay and Lahiri [1] (hereafter 
referred to as [BL]) carried out a detailed investigation of the properties of a 
suitably defined discrete Fourier transform (DFT) of irregularly spaced spa¬ 
tial data, and provided a characterization of the asymptotic independence 
property of the spatial DFTs. In this paper, we utilize the insights and find¬ 
ings of [BL] to formulate a frequency domain empirical likelihood (FDEL) 
for such spatial data. The FDEL method is shown to admit a version of the 
Wilks’ theorem for test statistics about spatial covariance parameters (e.g., 
having chi-square limits similarly to parametric likelihood), without explicit 
assumptions on the data distribution or the spatial sampling design. 

To highlight potential advantages of the FDEL approach in this con¬ 
text, suppose that {Z{s): s G (d G N = {1,2,...}) is a zero mean second- 
order stationary process that is observed at (irregularly spaced) locations 
si,... ,s„ in a domain C Also, suppose that we are interested in fit¬ 
ting a parametric variogram model {7(-;d) '-0 G 0}, 0 G (p G N) using 
the least squares approach (cf. Cressie [7]). A spatial domain approach is 
based on estimating the parameter 6 using 



where hi,... ,hm are some user specified lags and where 27 „(hj) is a non- 
parametric estimator of the variogram of the Z(-)-process at lag hj. Since 
the data locations si,...,s„ are irregularly spaced, a nonparametric esti¬ 
mator 27 „(-) of the variogram typically requires smoothing which results in 
a slow rate of convergence, particularly in dimensions d>2. Further, the 
asymptotic variance of On in such situations involves the spectral density of 
Z(-)-process and the spatial sampling density of the data-locations si,..., s„ 
(cf. Lahiri and Mukherjee [21]) which must be estimated from the data to 
carry out inference on 6 using the asymptotic distribution. In contrast, the 
FDEL approach completely bypasses the need to estimate 27 (-) directly and 
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it also carries out an automatic adjustment for the complicated asymptotic 
variance term in its inner mechanics, producing a distribution-free limit law 
that can be readily used for constructing valid tests and confidence regions 
for 9. See Example 3 in Section 3 for more details of the FDEL construction 
in this case and Section 6.2 for a data example demonstrating the advantages 
of the proposed spatial EDEL method over the traditional spatial domain 
approach. In general, the proposed EDEL method provides a nonparametric 
“likelihood”-based inference method for covariance parameters of a spatial 
process observed at irregularly spaced spatial data-locations without requir¬ 
ing specihcation of a parametric joint data model. 

Originally proposed by Owen [32, 33] for independent observations, em¬ 
pirical likelihood (EL) allows for nonparametric likelihood-based inference in 
a broad range of applications (Owen [34]), such as construction of confidence 
regions for parameters that may be calibrated through the asymptotic chi- 
squared distribution of the log-likelihood ratio. This is commonly referred 
to as the Wilks ’ phenomenon, in analogy to the asymptotic distributional 
properties of likelihood ratio tests in traditional parametric problems (Wilks 
[39]). In particular, EL does not require any direct estimation of variance or 
skewness (Hall and La Scala [13]). However, a difficulty with extending EL 
methods to dependent data is then to ensure that “correct” variance esti¬ 
mation occurs automatically within the mechanics of EL under dependence. 
For (regularly spaced) time series data, this is often accomplished by using 
a blockwise empirical likelihood (BEL) method (cf. Kitamura [17]), which 
was further extended to the case of spatial data observed on a regular grid 
by Nordman [26, 27] and Nordman and Caragea [28]. 

Monti [25] and Nordman and Lahiri [30] proposed periodogram-based 
EL methods for time series data. Their works show that, in view of the 
asymptotic independence of the DFTs, an analog of the EL formulation for 
independent data satisfies Wilks’ phenomenon in the frequency domain. As 
a result, the vexing issue of block length choice can be completely avoided by 
working with the DFTs of (regularly spaced) time series data. In this paper, 
we extend the frequency domain approach to irregularly spaced spatial data. 
Such an extension presents a number of unique challenges that are inherently 
associated with the spatial framework. First, the irregular spacings of the 
data locations make the usefulness of the DFT itself questionable, as the ba¬ 
sic orthogonality property of the sine- and cosine-transforms of gridded data 
at Fourier frequencies [i.e., at frequencies ujj = lirj jn for j = 0,1,..., (n — 1) 
for a time series sample of size n] no longer holds (cf. [BL]). Second, unlike 
the compact frequency domain [0,27r] for regular time series, in the case 
of irregularly spaced spatial processes sampled in d-dimensional Euclidean 
space one must deal with the unbounded frequency domain Third, as 
noted in Matsuda and Yajima [24] and [BL], the periodogram of irregularly 
spaced spatial data can be severely biased (for the spectral density) and 
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must be pre-processed. Finally, in contrast to the unidirectional flow of time 
that drives the asymptotics in the time series case, for irregularly spaced 
spatial data on an increasing domain, more than one possible asymptotic 
structure can arise depending on the relative growth rates of the volume 
of the sampling region and the sample size (cf. Cressie [7], Hall and Path 
[14], Lahiri [18]). A desirable property of any FDEL method for irregularly 
spaced spatial data would be to guarantee Wilks’ phenomenon for the spa¬ 
tial FDEL ratio statistic with minimal or no explicit adjustments for the 
different asymptotic regimes. This would ensure a sort of robustness prop¬ 
erty for the spatial FDEL and would allow the user to use the method in 
practice without having to explicitly tune it for the effects of different spatial 
asymptotic structures, which is often not very obvious for a given data set 
at hand (cf. Zhang and Zimmerman [40]). 

To motivate the construction of our spatial EDEL (hereafter SEDEL), 
first we briefly review some relevant results (cf. Section 2) that provide cru¬ 
cial insights into the properties of the DFT and periodogram of irregularly 
spaced spatial data under different spatial asymptotic structures. Our main 
result is the asymptotic chi-squared distribution of the SEDEL ratio statistic 
under fairly general regularity conditions on the underlying spatial process. 
However, it turns out that the spatial asymptotic structure has a nontrivial 
and nonstandard effect on the limit law. When the spatial sample size n 
grows at a rate comparable to the volume of the sampling region, we shall 
call this the pure increasing domain or PID asymptotic structure, while a 
faster growth rate of n (due to infilling) will be called the mixed increasing 
domain or MID asymptotic structure (see Section 2 for more details). To 
describe the peculiarity of the limit behavior of the SFDEL, let Tln{9o) de¬ 
note the SFDEL ratio statistic for a covariance parameter of interest 0 £ 
under Hq :6 = 6q based on a sample of size n. The main results of the paper 
show that under some regularity conditions, 

(1.1) _21og7^„(0o)^Xp 

under MID with a sufficiently fast rate of infilling. In contrast, under PID 
and under MID with a relatively slow rate of infilling, one gets 

( 1 . 2 ) -2lognn{9o)^2xl. 

Thus, the limit distribution of —21og7^n(0o) here changes from the more 
familiar Xp to a nonstandard 2Xp distribution, which points to the intricacies 
associated with spatial asymptotics. The main reason behind this strange 
behavior of the SEDEL ratio statistic is the differential growth rates of two 
components in the variance term of the DFTs of irregularly spaced spatial 
data, which alternate in their roles as the dominating term depending on 
the strength of the infill component. 
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To overcome the dichotomous limit behavior of —21og7?.„(0o) in (IT) 
and (1.2), we construct a data based scaling an = an(^o) (say) and show 
that the rescaled version, —a„21og7^„(0o) attains the same Xp limit, irre¬ 
spective of the underlying spatial asymptotic structure. This provides a uni¬ 
fied method for EL based inference on covariance parameters for irregularly 
spaced spatial data. In addition, the proposed SFDEL method accomplishes 
two major goals of the EL method of Owen [32, 33] for independent data: 

(i) it shares the strength of EL methods to incorporate automatic vari¬ 
ance estimation for spectral parameter inference in its mechanics under dif¬ 
ferent spatial asymptotic structures and, at the same time, 

(ii) it avoids the difficult issue of block length selection. 

A direct solution to either of these problems (i.e., explicit variance estimation 
and optimal block length selection) in the spatial domain is utterly difficult 
due to highly complex effects of the irregular spacings of the data sites 
and the spatial asymptotic structures (cf. Lahiri [18], Lahiri and Mukherjee 
[21]) and due to potentially nonstandard shapes of the sampling regions 
(Nordman and Lahiri [29], Nordman, Lahiri and Eridley [31]). Results from 
a simulation study in Section 6 show that accuracy of the SEDEL method 
with the data-based rescaling is very good even in moderate samples. 

The rest of the paper is organized as follows. In Section 2, we describe the 
theoretical framework and some preliminary results on the properties of the 
DFT for irregularly spaced spatial data from [BL] that play a crucial role in 
the formulation of the SFDEL method. We describe the SFDEL method in 
Section 3 and give some examples of useful spectral estimating equations. 
We state the regularity conditions and the main results of the paper in 
Sections 4 and Section 5, respectively. Results from a simulation study and 
an illustrative data example are given in Section 6. Proofs of the main results 
are presented in Section 7. Further details of the proofs and some additional 
simulation results are given in the supplementary material [3]. 

2. Preliminaries. 

2.1. Spatial sampling design. Suppose that for each n > 1 (where n de¬ 
notes the sample size), the spatial process Z{-) is observed at data locations 
si,...,s„ over a sampling region C M'^. We shall suppose that T>n is 
obtained by inflating a prototype set Vq by a scaling factor G [l,oo) as 

(2.1) Vn = Xn'Do, n>l, 

where (as the most relevant prototypical case) Pq is an open connected sub¬ 
set of (—1/2,1/2]'^ containing the origin and where t oo as n —?■ oo with 
\n S> for some e > 0. Note that this is a common formulation, allow¬ 
ing the sampling region Vn to have a variety of shapes, such as polygonal. 
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ellipsoidal and star-shaped regions that can be nonconvex. In practice, An 
can be determined by the diameter of a sampling region for use here (cf. 
Garcfa-Soidan [12], Hall and Path [14], Maity and Sherman [23], Matsuda 
and Yajima [24]). Let Z = {0, ±1, ±2,..To avoid pathological cases, we 
require that for any sequence of real numbers {an}n>i such that —)• 0+ as 
n —>• oo, the number of cubes of the form + [0,1)'^), j G that intersect 
both Vq and T>q is of the order 0([an]~^'^~^^) as n —>• oo. This boundary con¬ 
dition holds for most regions of practical interest. We also suppose that the 
irregularly spaced data locations si,..., Sji £ Vn are generated by a stochas¬ 
tic sampling design, as 

S 2 = Sin — An^^i^ 1 Y ^ ^ U, 

where {Xfc}fc>i is a sequence of independent and identically distributed 
(i.i.d.) random vectors with probability density /(x) with support cl.(Po)) 
the closure of Pq- Note that this formulation allows the number of sampling 
sites to grow at a different rate than the volume of the sampling region, 
leading to different asymptotic structures (cf. Cressie [7], Lahiri [18]). When 
n/An —)• c* € (0, 00 ), one gets the PID asymptotic structure while for n/A^ —>• 
00 as n —>■ 00 , one gets the MID asymptotic structure. Limit laws of common 
estimators are known to depend on the spatial asymptotic structure; see 
Cressie [7], Du, Zhang and Mandrekar [9], Lahiri and Mukherjee [21], Loh 
[22], Stein [37] and the references therein. 

2.2. Spatial periodogram and its properties. Define the DPT dn{uj) and 
the periodogram In{^) of {Z{si),... , Z(s„)} at a; £ as 

n 

( 2 . 2 ) dn{<^) = A'^‘^n~^'^Z{sj)ex.-p{iu'sj) and ^(a;) = |d„(a;)|^, 

i=i 

where i = y/—\. In an equi-spaced time series, formulation and properties 
of FDEL critically depend on the asymptotic independence of the DFTs 
(cf. Brockwell and Davis [6], Lahiri [19]) at the Fourier frequencies: Wj = 
2Trj/n,j = 1,..., n where n is the sample size. In a recent paper, [BL] showed 
that the spatial DFTs [in (2.2)] at two sequences of frequencies {u)in}n>i-, 
{^ 2 n}n>i C are asymptotically independent (i.e., the joint limit law is 
a product of marginal limits) if and only if the frequency sequences are 
asymptotically distant: 

(2.3) ||An(t<^ln - <^2n)|| ^00 aSU^OO. 

This suggests that in analogy to the time series FDEL (i.e., using that 
DFTs are approximately independent so that the independent data version 
of EL may be applied to resulting periodogram values), the formulation of 
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spatial FDEL should preferably be based on DFTs at a collection of fre¬ 
quencies that are well-separated. A second important finding in [BL] is that 
unlike the case of the equi-spaced time series data, the spatial periodogram 
/„(•) has a nontrivial bias, depending on the spatial asymptotic structure. 
In particular, [BL] shows that 

EIn{u) = [n-^Xia{0) + + o(l)) 

for all u £ M'^, where (t(-) and (/>(•) are respectively the autocovariance and 
the spectral density functions of the Z(-)-process and where K = x 

/^(u;) du;. As a result, the spatial periodogram /n(‘) has a nontrivial 
bias [for estimating K4>{-)] at all frequencies under BID, while the bias 
vanishes asymptotically under MID. However, the quality of estimation of 
the spectral density (up to the scaling by K) improves under both BID and 
MID through an explicit bias correction. Accordingly, we define the bias 
corrected periodogram 

(2.4) 4(u;)=4(^)-n-^A^<7„(0), u; G 

where (5'n(0) = ~ -^n)^ is the sample variance, with Zn = 

n~^ denoting the sample mean. We shall use In{') in our formu¬ 

lation of the SFDEL in the next section. 

3. The SFDEL method. 

3.1. Description of the method. For i.i.d. random variables, Qin and 
Lawless [35] extended the scope of Owen’s [33] original formulation, link¬ 
ing estimating equations and EL, and developed EL methodology for such 
parameters. In a recent work, Nordman and Lahiri [30] (hereafter referred to 
as [NL]) formulated a FDEL for inference on parameters of an equi-spaced 
time series dehned through spectral estimating equations (i.e., estimating 
equations in the frequency domain [—tt, tt]). In a similar spirit, we now define 
the SFDEL for parameters 0 G 0 C defined through spectral estimating 
equations (but now dehned over all of W^). Specihcally, let G x 0 —)• 

denote a vector of bounded estimating functions such that Gg{-) = G{-]9) 
satishes the spectral moment condition 

(3.1) / Gg{u)(f{u)du = 0, 

where recall that (p{-) denotes the spectral density of the process Z(-). Be¬ 
cause of their use in the SFDEL method to follow [cf. (3.3)], we refer to 
the functions Gg{u) as estimating functions, though these are not functions 
of data directly but rather of parameters 0 G 0 and frequencies a; G In 
view of the symmetry of the spectral density </>(•), without loss of gener¬ 
ality (w.l.g.), we shall assume that Gg{-) is symmetric about zero, that is, 
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G 0 {uj) = G 0 {—u) for all uj G An asymmetric G 0 {-) can always be sym¬ 
metrized, as in Example 2 of Section 3.2 below where we give examples of 
G 0 {-) in some important inference problems. 

The SFDEL dehnes a nonparametric likelihood for the parameter 9 us¬ 
ing a discretized sample version of the above spectral moment condition. 
Accordingly, for k G (0,1), rj G [k,oo) and G* G (0,oo), let 

(3.2) Af = Afn = {jA--:j G j G [-G*\l,G*Xlf} 

be the set of discrete frequencies, where is as in (2.1). Let N = \J\f\ 
be the size of M. Eor notational convenience, also denote the elements of 
J\f by k = 1,... ,N (with an arbitrary ordering of the N elements of 

Af). The frequency grid has two important qualities. First, since k <1, for 
any j / k, the sequences {cujn} and are asymptotically distant [cf. 

(2.3) ], guaranteeing their associated periodogram values are approximately 

independent. Further, {cji„,... ,u)isfn} forms a regular lattice over the hyper¬ 
cube [—C'*An~'^, C'*An~^]'^, with spacings of length A“^ in each direction, and 
[—C'*An~'^, 1 as A„ t oo when n —)• oo for 77 > k, covering the 

entire range of the integral in (3.1) in the limit. That is, the frequency grid 
expands to necessarily cover the entire frequency domain MA of interest. The 
exact conditions on k and i] are specified in Section 4 below. 

Now using the frequencies k = 1,...,A^}, we define the SFDEL 

function for 6 by 


(3.3) 


Cni9) =SUp 


N N 

'[\_Pk-'^Pk = ^,Pk > 0 and 
k=l k=l 


N 


'y ^ PfcG*g(CUfcy^)/^(CUfcy^) 


= 0 


k=l 


provided that the set of pk satisfying the conditions on the right-hand side is 
nonempty. When no such {pk} exists, Cn{9) is dehned to be 0. We note that 
the computation of (3.3) is the same as in EL formulations for independent 
data; see Owen [32, 34] and Qin and Lawless [35] for these details. 

Next, note that without the spectral moment constraint, Wk=iPk attains 
its maximum when each pk = 1/A^. Hence, we define the SFDEL ratio statis¬ 
tic for testing the hypothesis Hq :9 = 9o as 

nn{9o) = Cn{eo)/{N-^). 


The SFDEL test rejects Hq for small values of Tln{9o). Similarly, one can 
use the SFDEL method to construct confidence regions for 9 using the large 
sample distribution of the SFDEL ratio statistic. In Section 4, we state a set 
of regularity conditions that will be used for deriving the limit distribution of 
—21og7^„(0o)- This, in particular, would allow one to calibrate the SFDEL 
tests and confidence regions in large samples. 


SPATIAL EMPIRICAL LIKELIHOOD 


9 


3.2. Examples of estimating equations. We now give some examples of 
spectral estimating equations for parameters of interest in frequency domain 
analysis (cf. Brockwell and Davis [6], Cressie [7], Journel and Huijbregts [16], 
Lahiri, Lee and Cressie [20]). 


Example 1 (Autocorrelation). Suppose that we are interested in non- 
parametric estimation of the autocorrelation of the Z(-)-process at lags 
hi,...,hp for some p > 1. Then 6 = (^(hi),..., ^j(hp))' with ^>(h) = 
corr(Z(h), Z(0)) = f cos (h'u;)(/>(u;) dco/ f <f>{u)du where A' denotes the 
transpose of a matrix A. Thus, in this case, 

(3.4) G 0 {uj) = (cos (h)^^),... ,cos (hpO;))^ — 6 . 

Estimating functions can also be formulated with hypothesized autocor¬ 
relations (e.g., white noise) to set-up goodness-of-fit tests in the SFDEL 
approach, in the spirit of Portmanteau tests Brockwell and Davis [6] . 


Example 2 (Spectral distribution function). For t = (ti, ... ,td)' G 
let 



l(-oo,t](^)<(>(<^)c?<^ 



denote the normalized spectral distribution function, where !(•) denotes the 
indicator function and (—oo,t] = (—oo,ti] x ••• x {—oo,td]- The function 
‘h*^(-) plays an important role in determining the smoothness of the sample 
paths of the random held Z{-) (cf. Stein [37]). Suppose that the parameter of 
interest is now given hy 9 = (<I>*^(ti),..., <l>°(tp))' for some given set of vectors 
ti,...,tp £ M'^. In this case, the relevant estimating function is Gg(u:) = 
[Ge{u) -L Ge{—ijj)]/2, u £ where 

(3-5) Gg{u) = (l(-oo,ti](‘^)! ■ ■ ■ !l(-oo,tp](^))^ - 


Example 3 (Variogram model htting). A popular approach to htting a 
parametric variogram model to spatial data is through the method of least 
squares (cf. Cressie [7]). Let {2y{-;6):6 £ 0}, 0 C be a class of valid 
variogram models for the true variogram 27 (h) = Var(Z(h) — Z(0)), h £ 
of the spatial process. Let 2'y{-]6) = 2y{--,6)/a{0) and 27 (-) = 2y{-)/a{0) de¬ 
note their scale-invariant versions, where (t( 0) = Var(Z(0)). Also, let 27 „(h) 
denote the sample variogram at lag h based on Z(si),..., Z(s„) (cf. Chap¬ 
ter 2, Cressie [7]), scaled by (T„(0) = n~^ i^i) ~ ^n)^ where Zn = 

^”^1 Z(sj). Then one can £t the variogram model by estimating the 
parameter 6 by 

9n = argmin|^( 27 „(hj) - 2y{h.i-,9)f -.O £ cj 
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for a given set of lags hi,..., h^- This corresponds to minimizing the popula¬ 
tion criterion X]^i( 27 (hj) — 27 (hj; 0 ))^ which, under some mild conditions, 
determines the true parameter 9q uniquely (cf. Lahiri, Lee and Cressie [20]). 
Under these conditions, 9 = 6o is the unique solution to the equation 

m 

^(27(h,) - 27(h,; 0))V[27(h,; 0)] = 0, 

i=l 


where V[ 27 (h; 0)] denotes the p x 1 vector of first-order partial derivatives of 
27 (h; 0 ) with respect to 9. Hence, expressing the variogram in terms of the 
spectral density function, we get the following equivalent spectral estimating 
equation: 


(3.6) 


^{1 - cos(h'cj) - 7 (h; 6 ')}V[ 27 (hi; 6 »)] 


_ 2=1 


du = 0, 


which can be used for defining the SFDEL for 9. As pointed out in Sec¬ 
tion 1, the spatial domain approach yields asymptotically correct confidence 
regions for 9 through asymptotic normal distribution of 9^ but it necessar¬ 
ily requires one to estimate the limiting asymptotic variance and is subject 
to the curse of dimensionality, resulting from nonparametric smoothing in 
d-dimensions. In comparison, the SFDEL can be applied with the spectral 
estimating equation (3.6) to produce asymptotically correct confidence re¬ 
gion for 0, without explicit estimation of the standard error. 


Note that the spectral estimating equation approach can also be extended 
to estimation of 9 based on the weighted- and the generalized-least squares 
criteria (cf. Cressie [7], Lahiri, Lee and Cressie [20]), where in addition to 
the partial derivatives, suitable weight matrices enter into the corresponding 
versions of (3.6). A similar advantage of the spatial FDEL method continues 
to hold in these cases. 

In the next section, we introduce some notation and the regularity con¬ 
ditions to be used in the rest of the paper. 


4. Regularity conditions. 

4.1. Notation and lemmas. First, we introduce some notation. For x, y G 
M, let x+ = max{x,0}, [xj = the floor function of x, x Ay = min{x,y} and 
X V y = max{x, y}. Let Ik denote the identity matrix of order k (/c > 1). For 
two sequences {s^} and {tn} in (0, oo), we write ~ tn if lim^^oo Sn/tn = 1- 

For X = (xi,..., Xfc)' G M^, let ||x||i = [xil H-h |xfc| and ||x|| = (|xip-|-h 

\xk\^Y^‘^ respectively denote the and £^-norms of x. Also, let di(F'i, F 2 ) = 
inf{||x —s||i: x G Ei,s G F' 2 }, Fi, F 2 C M^. For a, 6 G (0, 00 ), define the strong 
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mixing coefficient of Z {-) as a(a; h) = sup{|P(74i n A^) — P{Ai)P{A 2 )\ : Ai G 
Pz{Pi),Ei £ Cb,i = l,2,di{Ei,E2) > a} where Ez{E) = a ( Z ( s ) :s G E) and 
Cft is the collection of d-dimensional rectangles with volume b or less. 

As indicated earlier, we suppose that the random field { Z ( s ) :s G is 
second-order stationary (but not necessarily strictly stationary) with zero 
mean and autocovariance function it(-) and spectral density function 
Also, recall that the scaling sequence Xn is as in (2.1) and that k, rj and N 
are as in Section 3.1, specifying the SFDEL grid in the frequency domain. 
Further, the constant c* = lim^^oou/A^ determines the spatial asymptotic 
structure where c* G (0,oo) for PID and c* = oo for MID. Write Cn = njX^, 
/*(a;) = In{<^) — c^^t(O) and An{u) = c~^a{0) + K4>{uj), co G where 
E = (27r)^ff^. Let = 2 Y2k=i Geo{^kn)Geoi^knyAl{^kn)- Also, let 
denote the jth component of Gg^. Write 6^ = + X'^'^. From Section 7, 

it follows that 6^ gives a unified representation for growth rate of the self- 
normalizing factor in the SFDEL ratio statistic under different asymptotic 
structures considered in the paper. 


4.2. Conditions. We are now ready to state the regularity conditions. 

(C.O) The strong mixing coefficient satisfies a{a,b) < 71 ( 0)72 (&), for any 
a,bG ( 0 , 00 ), with respect to some left continuous nonincreasing func¬ 
tion 71 : ( 0 , 00 ) ^[ 0 , 00 ) and some right continuous nondecreasing func¬ 
tion 72 : ( 0 , 00 ) —>-( 0 , 00 ). 

(C.l) There exist 6 G (0,1] such that ^ 4 - 1-5 = sup{(E|Z(s)|^+‘^)^A4+<5) .g g 
< 00 and k^^[ji{k)]GA+^) < 00 . 

(C.2) (i) The spatial sampling density /(•) is everywhere positive on Vq and 
satisfies a Lipschitz condition: There exists a Cq G (0,oo) such that 

l/(x) - /(y)| < Co||x-y|| for all x,y gPq- 


(ii) There exist Gi G (0,oo) and oq G {d/2,d] such that 


P^^f{x)dx 


-L 


g.a;'xy2(x)^X 


<C'i||o;| 


—ao 


for all ||cj|| > Cl. 

(C.3) (i) For each j = Gj^g^y) is bounded, symmetric, and almost 

everywhere continuous on (with respect to the Lebesgue measure), 
and f Ggg(uj)(/>(u;) du; = 0; 

(ii) There exist C 2 G (0, 00 ) and a nonincreasing function h: [0, 00 ) —>■ 
[ 0 ,oo) such that \4>{<jo)\ </i(||a;||) for all ||a;|| >C 2 ; 

(hi) hminf„^oodet(A^“^X]^=iG'eo(^fcn)G 6 )o(a;fc„)') > 0 ; 

(iv) I Geo {u})GgQ{u)'(lP‘{u) du is nonsingular. 

(C.4) (i) 0 < At < 7 < 1; and 
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(ii) 

(C.5)' For each n > 1, there exists a function Mn{-) such that 


N 

Geo {^kn)Geo {^^kn)' exp(it'cjfc„) 

k=l 


< Mn{t) 


for all t G 


and with dv{t,'x) = ||t|| 7 i(||t||)^/(^"''^^/(x) dtdx. and 6 G (0,1] of (C.l), 


Mn{t + ai [s + 2 A„a 2 X + 2 A„a 3 y]) dv{t, x) dv{s, y) 
= A^+“i) for all oi, 02,03 G {0,1}. 


We comment on the conditions. Conditions (C.O)-(C.l) are standard mo¬ 
ment and mixing conditions on the spatial process Z{-) (cf. Lahiri [18]), 
which entail that Z{-) must be weakly dependent and are used to ensure 
finiteness of the variance of the periodogram values [which are themselves 
quadratic functions of ■Z’(s)], among other things. See Doukhan [8] for process 
examples fulhlling such conditions, including Gaussian, linear and Markov 
random fields. Also, note that the function 72 (-) in (C.O) is allowed to grow to 
infinity to ensure validity of the results for bonafide strongly mixing random 
fields in d > 2 (cf. Bradley [4, 5]). Condition (C.2) specifies the requirements 
on the spatial design density /. Part (i) of (C.2) is a smoothness condition 
on / while part (ii) requires the characteristic functions corresponding to 
the probability densities /(•) and f to decay at the rate 0(||a;||““°) 

as ||a;|| —>-oo. Condition (C.2) is satisfied [with oq = d in (ii)] when /(•) is the 
uniform distribution on a rectangle of the form (—si,ti) x • • • x (—Sd,trf) for 
some 0 < Si,ti < 1/2 for all i = 1,... ,d. However, there exist many nonuni¬ 
form densities that also satisfy (C.2) with oq = d. 

Condition (C.3) specifies the regularity conditions on the spectral esti¬ 
mating function Geo- In addition to the spectral moment condition (3.1), 
parts (i) and (ii) of (C.3) provide sufficient conditions that make the errors 
of Riemann sum approximations to the variance integral f Geo(^)Geo(<^)' x 
(f){ijj)d(jJ asymptotically negligible. Conditions (C.3)(iii) and (iv) provide al¬ 
ternative forms of a sufficient condition that guarantees nonsingularity of the 
pxp matrix through a subsequence under PID and for the full sequence 
under (a subcase of) the MID asymptotic structure, respectively. Without 
these, the degrees of freedom of the limiting chi-squared distribution of the 
scaled log-SFDEL ratio statistic can be smaller than p. It is easy to verify 
that the examples presented in Section 3 satisfy condition (C.3), under mild 
conditions on the hj’s in Example 1, on the tj’s in Example 2, and on the 
hj’s and the parametric variogram model 2'y{-]6) in Example 3. 

Next, consider condition (C.4). The hrst part of (C.4) states the require¬ 
ments on the SEDEL tuning parameters k and rj that must be chosen by 
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the user in practice. Note that k and rj determine a Riemann-sum approx¬ 
imation to the spectral moment condition (3.1) over the discrete grid (3.2) 
where k determines the grid spacing while r] determines the range of the 
approximating set [—C'*An~”^, Thus, one must choose these pa¬ 
rameters so that the grid spacing is small and the integral of out¬ 
side is small. On the other end, k needs to satisfy 

the requirement 0 < «: < 1 to ensure that the neighboring frequencies in 
J\f are “asymptotically distant.” Section 6 gives some specific examples 
of the choices of k and r] in hnite sample applications. As for condition 
(C.4)(ii), note that the “asymptotically distant” property of the frequencies 
in Af renders the summands in J2k=i^doi'^kn)In{<^kn) approximately in¬ 
dependent, and hence, under suitable normalization, the sum must have a 
Gaussian limit. One set of sufficient conditions for the weak convergence of 

T,k=iGeoi^kn)[In{^kn) - EI*{ukn)] to a Gaussian limit is given by 
a CLT result in Bandyopadhyay, Lahiri and Nordman [2] (hereafter referred 
to as [BLN]). Alternative sufficient conditions for the GLT in (G.4)(ii) can 
also be derived requiring that Z{-) is a d-dimensional linear process, but we 
do not make any such structural assumptions on Z{-) here. 

Finally, consider condition (C.5)' which will be used only in the MID case 
(cf. Theorems 5.2 and 5.3). This condition can be verified easily when the 
Fourier transform ^j^k (say) of the function Gjfi^Gkfio decays quickly, for all 
^ < j-,k <P- In contrast, if the functions P^j^k do not decay fast enough, one 
can verify (C.5)' using Lemma 7.4 and the arguments in the proof of the 
result below, which shows that condition (C.5)' holds for Examples 1-3. 

Proposition 4.1. For Ge{-) of Examples 1-3, condition (C.5)' holds. 

The next section states the main results of the paper under PID and MID. 

5. Asymptotic distribution of the spatial FDEL ratio statistic. 

5.1. Results under the PID asymptotic structure. Let Px denote the 
joint distribution of the random vectors Xi, X 2 ,..., generating the locations 
of the data sites (cf. Section 2.1). The following result gives the asymptotic 
distribution of the SFDEL ratio statistic under PID. 

Theorem 5.1. Suppose that conditions (C.0)-(G.4) and that c^, G 

(0,oo). Then-loglZnido)xl,a.s. (Py^). 

Theorem 5.1 shows that under conditions (G.0)"(C.4) [and without re¬ 
quiring (G.5)'], the SPDEL log-likelihood ratio statistic has an asymptotic 
chi-squared distribution, for almost all realizations of the sampling design 
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vectors {X,}. Note that the scaling for the log SFDEL ratio statistic is 
nonstandard—namely, the chi-squared limit distribution is attained by 
— XogTZniOo), but not by the more familiar form —21og7?.„(0o) as in Wilks’ 
theorem and as in the time series FDEL case (cf. Nordman and Lahiri [30]). 
This is a consequence of the nonstandard behavior of the periodogram for 
irregularly spaced spatial data (cf. Section 3). However, as the limit dis¬ 
tribution of the SEDEL ratio statistic does not depend on any unknown 
population quantities, it can be used to construct valid large sample tests 
and confidence regions for the spectral parameter 0. Specihcally, a valid large 
sample level a G (0,1/2) SEDEL test for testing 

(5.1) Ho:e = eo vs 

will reject Hq if —logTln{Oo) > where Xi-a,p denotes the (1 — a) 

quantile of the Xp-distribution. Eor SEDEL based conhdence regions for 6 , 
a similar distribution-free calibration holds (cf. Section 5.3). 

Remark 5.1. Note that the distribution of Tln{0o) depends on two 
sources of randomness, namely, the spatial process {Z[s) :s G and the 
vectors {Xj}j>i. Let C{T\X) denote the conditional distribution of a ran¬ 
dom variable (based on both {Z{-)} and {Xj}), given X = (t(Xi, X 2 ,...) 
and let di denote the Levy metric on the set of probability distributions 
on M. Then a more precise statement of the Theorem 5.1 result, under the 
conditions given there, is 

(iL(£(-log7^„(6»o)|T’),Xp) = 0(1) a.s. (Px)- 
A similar interpretation applies to the other theorems presented in the paper. 

5.2. Results under the MID asymptotic structure. The limit behavior of 
the spatial FDEL ratio statistic under the MID asymptotic structure shows a 
more complex pattern and it depends on the strength of the inhll component. 
Note that Cn = n/Xf^ denotes the relative growth rate of the sample size and 
the volume of the sampling region of Dn, and hence, 00 as n —)• 00 under 

MID, with a higher the value of Cn indicating a higher rate of inhlling. The 
following result gives the asymptotic behavior of the SFDEL ratio statistic 
under different growth rates of Cn- 

Theorem 5.2. Suppose that conditions (C.0)-(C.4) and (0.5)' hold 
[where (C.3) may be replaced by (C.3)(i), (ii), (iv) for part (b)/. 

(a) (MID WITH A SLOW RATE OF INFILLING). // 1 < then 

- logP„(6lo) Xp, as- /Px/- 

(b) (MID WITH A FAST RATE OF INFILLING). If > N , 
-21ogPn(6'o)Xp, a-s- r^x/- 


then 
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Theorem 5.2 shows that the asymptotic distribution of — log7?.„(0o) can 
be different depending on the rate at which the infilling factor c„ goes to 
infinity. When the rate of decay in is slower than the critical rate ~ 

Xn corresponding to the asymptotic volume of the frequency grid (i.e., 
determined by the number N oc XfP of frequency points on a regular grid 
and the M'^-volume X~'^^ between grid points), the negative log SFDEL ratio 
has the same limit distribution as in the PID case. However, when the factor 

grows at a faster rate than Xn the more familiar version of scaling 
—2 is appropriate for the log SFDEL ratio. From the proof of Theorem 5.2, 
it also follows that in the boundary case, that is, when ~ X^ the 
limit distribution of — log7?.„(0o) is determined by that of a quadratic form 
in independent Gaussian random variables and is not distribution-free. As 
a result, this case is not of much interest from an applications point of view. 
However, as = u/A^ is a known factor, one can always choose the SFDEL 
tuning parameters k, rj to avoid the boundary case. 

Remark 5.2. Theorem 5.2 shows that when the rate of infilling c„ does 
not grow too fast, the presence of the infill component does not have an 
impact on the asymptotic distribution of the log SFDEL ratio statistic. 
Thus, the limit behavior under the PID asymptotic structure has a sort 
of robustness that extends beyond its realm and covers parts of the MID 
asymptotic structure in the frequency domain. This is very much different 
from the known results on the limit distributions of the sample mean and of 
asymptotically linear statistics in the spatial domain where all subcases of 
the MID asymptotic structure lead to the same limit distribution and where 
the MID limit is different from the limit distribution in the PID case (cf. 
Lahiri [18], Lahiri and Mukherjee [21]). 

5.3. A unified scaled spatial FDEL method. Results of Sections 5.1 and 
5.2 show that in the spatial case, the standard calibration of the EL ra¬ 
tio statistic may be incorrect depending on the relative rate of infilling. 
Although nonstandard, —21og7?.„(0o) has the same 2Xp distribution under 
the PID spatial asymptotic structure for all values of c* = lim„_).oo n/A[(. In 
contrast, the limit distribution of —21og7^n(^o) can change from the non¬ 
standard 2Xp to the standard Xp under the MID asymptotic structure when 
the rate of inhlling is faster. While this gives rise to a clear dichotomy in 
the limit, the choice of the correct scaling constant, and hence, the correct 
calibration may not be obvious in a hnite sample application. To deal with 
this problem, we develop a data based scaling factor that adjusts itself to 
the relative rates of infilling and delivers a unihed Xp limit law under the 
PID as well as under the different subcases of the MID. Specifically, define 
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the modified FDEL statistic 


-2an{0)logTZn{0), 


where 

Shi 

Ef.i 


Note that for any 0, the factor an{0) can be computed using the data 
{Z(si),..., Z(s„)}, where the numerator of an{0) is computed using the 
bias-corrected periodogram while the denominator is based on the raw peri- 
odogram. For the testing problem Hq -.9 = 9q against Hi:9^9o, this requires 
computing the factor an{-) once. However, for constructing conhdence inter¬ 
vals, an{9) must be computed repeatedly and, therefore, this version of the 
SFDEL is somewhat more computationally intensive. 

To gain some insight into the choice of an{9), note that it is based on the 
ratio of the sums of the periodogram and its bias-corrected version that are 
weighted by the squared norms of the function Gg{-) at the respective fre¬ 
quencies ujkn- As explained before, the bias correction of the periodogram of 
irregularly spaced spatial data is needed to render the EL-moment condition 
in (3.3) unbiased. However, this leads to a “mismatch” between the variance 
of the sum Ylk=i^si^kn)Ini^kn) and the automatic scale adjustment fac¬ 
tor provided by the EL method. The numerator and the denominator of 
an{9) capture the effects of this mismatch under different rates of inhlling 
and hence, an{-) provides the “correct” scaling constant under the different 
asymptotic regimes considered here. 

We have the following result on the modified SFDEL ratio statistic. 


Theorem 5.3. Suppose that the conditions of one of Theorems 5.1-5.2 
hold. Then, under 9 = 9q, 

(5.3) - 2a„(6'o)log7^„,(6»o) Yp a.s. (P^). 

Theorem 5.3 shows that the modified SFDEL method can be calibrated 
using the quantiles of the chi-squared distribution with p degrees of freedom 
for all of the three asymptotic regimes covered by Theorems 5.1-5.2. Thus, 
the empirically scaled log-SFDEL ratio statistic provides a unified way of 
testing and constructing confidence sets under different asymptotic regimes. 
Specifically, for any a € (0,1/2), 

= {0 G 0 : -2an{9) log7^„(0) < xLaJ 

gives a conhdence region for the unknown parameter 9 that attains the 
nominal conhdence level (1 — a) asymptotically. The main advantage of 
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the SFDEL method here is that we do not need to find a studentizing 
covariance matrix estimator explicitly, which by itself is a nontrivial task, 
as this would require explicit estimation of the spectral density 4>{-) and the 
spatial sampling density /(•) under different asymptotic regimes. 

6. Numerical results. 

6.1. Results from a simulation study. Here, we examine the coverage 
accuracy of the SFDEL method in finite samples, applied to a problem of 
variogram model fitting described in Section 3.2. We consider an exponential 
variogram model form (up to variance normalization) 

27 (h; 6 'i, 6 ' 2 ) = 1 - exp[-6»i|/ii| - 02 \h 2 \], 

with parameters 0 i,02 > 0 where h = {hi,112)' G Over several sampling 
region sizes 1/2,1/2)^, An = 12,24,48, and sample sizes n = 100, 

400,900,1400, we generated i.i.d. sampling sites si,...,s„ G Pn and real¬ 
valued stationary Gaussian responses Z{-) following the exponential vari¬ 
ogram form with 0i = 02 = 1 and EZ{s) = 0, Var[Z(s)] = 1 (the simulation 
results are invariant here to values for the mean and variance). In the spatial 
sampling design (cf. Section 2.1), two distributions / for sites were consid¬ 
ered, one being uniform over T>q and the other being a mixture of two bivari¬ 
ate normal distributions 0.5AI((0,0)',l2) + 0.5N((l/4,1/4)', 2 I 2 ), truncated 
outside Vq, where I 2 denotes a 2 x 2 identity matrix. 

In implementing the modified SFDEL method to compute 90% confi¬ 
dence regions for 6 = {01,02), we used the estimating functions in (3.6) over 
m = 2 sets of lags hi, h 2 G and evaluated the (sample mean-centered) pe- 
riodogram at scaled frequencies Mn = :j G n [—C'*A„, C'*An]^}; we 

varied values C* = 1,2,4 (with r] = l held fixed) and k = 0.05,0.1,0.2 along 
with considering different combinations of lags hi, h 2 . Recall that C* and 
K respectively control the number and spacing of periodogram ordinates, 
where choices of k here roughly induce spacings between frequencies of 1, 
0.75 or 0.5 in horizontal/vertical directions; in our findings, these spacings 
were adequate whereas tighter spacings (e.g., k > 0.4) tended to perform 
less well by inducing stronger dependence between periodogram ordinates. 

The coverage results (based on 1000 simulation runs) are listed in Tables 
1-2 for the lag hi = (1,1)',h 2 = (1,-1)' for the uniform and nonuniform 
spatial sites, respectively, with the results for the other sets of lags reported 
in the supplementary material [3] . Except for the occasions with the smallest 
lag combination [hi = (1,1)', h 2 = (1, —1)^] and the smallest sampling region 
An = 12 with large n, the coverages tended to agree quite well with the 
nominal level. Further, the coverage levels were largely insensitive to the 
number and spacing of periodogram ordinates for various sample and region 
sizes. Results for both stochastic sampling designs were also qualitatively 
similar. 
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Table 1 

Coverage percentage of 90% SFDEL regions for variogram model parameters 9 (uniform 

design) 


hi = (l,l)',h2 = (l,-l)' 


c* 

K 


\n 

= 12 




= 24 




= 48 


100 

400 

900 

1400 

100 

400 

900 

1400 

100 

400 

900 

1400 

1 

0.05 

86.4 

85.6 

82.0 

80.3 

88.9 

87.8 

87.8 

89.9 

89.3 

89.4 

89.7 

87.9 

1 

0.1 

87.1 

85.3 

78.6 

75.9 

89.0 

90.2 

89.6 

90.4 

89.0 

91.4 

91.5 

90.0 

1 

0.2 

86.5 

85.1 

81.1 

76.4 

90.0 

88.7 

90.1 

89.7 

87.6 

87.9 

87.9 

88.9 

2 

0.05 

88.1 

87.8 

86.1 

85.9 

89.0 

88.6 

89.7 

87.9 

89.2 

88.9 

90.5 

89.7 

2 

0.1 

86.6 

86.8 

86.2 

84.2 

89.2 

88.4 

91.1 

89.9 

90.6 

90.0 

90.0 

91.4 

2 

0.2 

89.6 

88.8 

84.6 

83.8 

88.9 

89.9 

89.9 

89.2 

89.9 

89.3 

88.1 

89.4 

4 

0.05 

89.0 

87.8 

89.6 

88.1 

89.3 

89.0 

90.1 

90.2 

92.9 

88.2 

90.6 

89.9 

4 

0.1 

86.3 

88.6 

88.7 

86.4 

90.3 

89.4 

90.3 

89.2 

92.0 

87.8 

90.8 

89.1 

4 

0.2 

88.4 

89.0 

87.4 

87.9 

88.7 

88.9 

90.0 

89.6 

92.8 

88.6 

88.5 

88.8 


6.2. An illustrative data example. As a brief demonstration of the SFDEL 
method, we consider a coal seam dataset based on a SAS example ([36], 
Chapter 70). Figure 1 shows locations of 105 sampling sites and the cor¬ 
responding distribution of coal seam thickness. Coal seam measurements 
often exhibit spatial smoothness (Journel and Huijbregts [16], page 165), 
as also indicated in the empirical semivariogram in Figure 1 (found by 
binning distances into 10 bins up to half the maximum distance between 
points and plotting Matheron’s average over each bin against the bin mid- 


Table 2 

Coverage percentage of 90% SFDEL regions for variogram model parameters 9 

(nonuniform design) 


hl = (l,l)',h2 = (l,-l)' 


C * 

K 


An 

= 12 



An 

= 24 



An 

= 48 


100 

400 

900 

1400 

100 

400 

900 

1400 

100 

400 

900 

1400 

1 

0.05 

88.3 

86.8 

85.5 

79.6 

89.4 

88.9 

86.4 

90.1 

90.2 

89.2 

89.6 

90.0 

1 

0.10 

85.7 

83.5 

80.3 

78.7 

88.8 

87.7 

89.6 

92.0 

87.0 

90.5 

89.5 

89.3 

1 

0.20 

87.9 

86.0 

82.4 

79.1 

89.6 

90.0 

90.7 

90.0 

87.4 

88.2 

88.8 

88.7 

2 

0.05 

89.4 

89.3 

88.0 

83.8 

90.1 

88.6 

89.7 

88.9 

89.6 

90.7 

89.5 

91.2 

2 

0.10 

86.2 

87.7 

84.3 

85.9 

89.0 

90.7 

90.0 

88.4 

90.1 

91.5 

90.1 

90.0 

2 

0.20 

88.7 

89.5 

88.5 

85.6 

90.7 

90.4 

89.7 

88.5 

89.7 

88.5 

90.3 

89.8 

4 

0.05 

89.5 

89.5 

88.3 

88.0 

87.7 

90.0 

88.6 

90.7 

91.8 

89.8 

89.2 

90.9 

4 

0.10 

87.0 

88.8 

87.4 

86.2 

89.0 

89.9 

87.9 

89.4 

91.7 

87.9 

88.2 

89.9 

4 

0.20 

90.6 

89.1 

89.1 

86.3 

89.5 

89.0 

87.9 

89.4 

91.5 

90.2 

89.3 

90.1 
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Easting (10,000 ft increments) Coai Seam Thickness (in ft) Distance (10,000 ft increments) 


Fig . 1 . Coal seam data: Sampling locations, distribution of thickness, empirical semivar- 
iogram. 

point). Following the SAS analysis, this suggests a Gaussian variogram 
model 27 (h; 61,62) = 26 i[l — exp(—||h|p/ 02 )], h £ with scale 6*1 > 0 and 
range 02 > 0 parameters, though the present data are synthetic with a value 
02 = 1 as explained below. 

Note that the spatial locations are not clearly uniform nor is the marginal 
distribution apparently normal. To ht the variogram model in a way that 
allows nonparametric confidence intervals (CIs) to access the precision of 
the estimated parameters, without making assumptions about the joint dis¬ 
tribution of the data or the distribution of spatial locations, one can apply 
the SFDEL method using estimating functions as in Example 3 motivated 
by least squares estimation. Alternatively, one can apply a kernel bandwidth 
estimator of the varigoram for which large sample distributional results are 
recently known (Garcia-Soidan [12], Maity and Sherman [23]). 

We focus on the range parameter 02. Using a lag set hi = (1/4,1/4)', 
h 2 = (1,1)', h 3 = (2,2)' in SEDEL, motivated by empirical lags in Eig- 
ure 1, the maximized SEDEL function produces a point estimate 62 = 1.123 
(xlOjOOO ft) with a 90% SEDEL GI for 62 as (0.896,1.571). This arises 
from a frequency grid : [—C*\n, C'*An]^ H 1?}, C* = 2, k = 0.2 based 

on An = 10 for sampling region in Eigure 1. With a larger frequency grid 
C* = 4, K = 0.2, the 90% SEDEL GI is similar (0.887,1.378) with a point es¬ 
timate 1.071, and increasing the grid spacing k = 0.1 produces similar range 
estimates (1.107 and 1.101 for C* = 2,4) and intervals. In contrast, using the 
lags above and the Nadaraya-Watson kernel estimator of the semivariogram 
7(h) (based on the Epanechnikov kernel, cf. Garcia-Soidan [12]), the range 
parameter estimates are 1.505,1.230,1.335 for bandwidths /i = 0.5,1,1.5, 
where h = 0.5 arises from MSE optimal order considerations. This approach 
can also produce large-sample nonparametric GIs based on normal limits for 
An[7(h) — 7(h)], having a covariance matrix C -V, C = [J f /^, involv¬ 
ing the unknown density / of locations {sj/An}l£^ on [0,1]^. Eor bandwidths 
h = 0.5,1,1.5, the 90% GIs for 02 are given by (1.203,1.743), (0.972,1.512), 
(1.038,1.564) based on C = 1.23 from bivariate kernel density estimation 














20 


S. BANDYOPADHYAY, S. N. LAHIRI AND D. J. NORDMAN 


(cf. Venables and Ripley [38]) and simplifying the matrix V by assuming 
the process is Gaussian (cf. Garci'a-Soidan [12], pages 490-491). Unlike CIs 
from kernel estimation, the SFDEL CIs require no variance or density ap¬ 
proximation steps, tend to be less sensitive to tuning parameters, and all 
contain the true value 02 = 1 here. 

To provide some assessment of the Cl methods, we conducted a small 
simulation study generating marginally standard normal variates {Y (si)}l^£^ 
with correlation corr[y(s), V(s-|- h)] = exp(— ||h||^) at the locations 
in Figure 1 and defining observations {Z{si) = — 1] +40.23}j^£^ 

from a spatial process having a Gaussian variogram as above with scale 6i = 
7.5 and range 02 = 1; this data-generation approximately matches features in 
the original SAS coal seam data and also produced the data example above. 
Based on 1000 simulations, 90% CIs for the range parameter 02 from the 
SFDEL method had coverages 90.5,87.4,93.3 for G* = 2 and 89.3,88.7,86.4 
for C* =4, over grid spacings k = 0.05,0.1,0.2. In contrast, 90% CIs for 02 
from the kernel estimation approach had actual coverages 68.4,74.4,52.7 for 
bandwidths h = 0.5,1,1.5. 

7 . Proofs of the results. 

7.1. Notation and lemmas. Define the bias corrected periodogram 
In{^) = In{^) — Xfi^aniO) and its (unobservable) variant I^i^) = In{^) — 
n“^A£(T(0). Recall that A„(a;) = c“^it(0) -|- K ( p { u :), u £ where K = 
(27r)'^ J/^. For notational simplicity, for a random quantity T depending 
on both {Z(s):s£M'^} and {Xi,X 2 ,...}, ET will denote the conditional 
expectation of T given X= {Xi,X 2 ,...} and likewise P will denote condi¬ 
tional probability. Thus, in the following, P(—21og77„(0o) < t) in fact refers 
to 


P(-21og77„(0o) < t|X), t>0. 

Also, write Px and Ex to denote the probability and the expectation under 
the joint distribution of Xi,X 2 ,.... Further, let C or G(-) denote generic 
constants that depend on their arguments (if any), but do not depend on n 
or the {Xj}. 

We now provide some lemmas that will be used for proving the main re¬ 
sults of the paper. Proofs of the lemmas and Proposition 4.1 are relegated to 
the supplementary material [3] to save space. For continuity, supplementary 
material [3] begins with three technical lemmas (Lemmas 7.1-7.3), providing 
some general cumulant and integral inequalities as well as the bias and the 
variance of the periodogram Ini') that are used to establish Lemmas 7.4-7.7 
below; these results may also be of independent interest. As presented next, 
Lemmas 7.4-7.7 deal with various properties and sums of the periodogram 
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that we will need to analyze the asymptotic behavior of the SFDEL ratio 
statistic under different asymptotic structures and establish the main results 
in Section 7.2. 


Lemma 7.4. Under conditions (C.0)-(C.3) and (C.5)' 

N 

E IE Gkn)G 


Lfc=l 


N 


■ ‘^'y^^GeQ{^^kn)G0fj{uJkny ^ni^kn) + o(^n) 


k=l 


Lemma 7.5. Under conditions (C.0)~(C.3) and (C.5)', 

N 

Geo {^in)GeQ {u}in)'[in{^in) — i^ni^in) + in))] 

i=l 

= Op{bl) a.s. (P^). 

Lemma 7.6. Under conditions (C.0)-(C.3) and (C.5)', for any s > 0, 
P(maxi<fc<7v \\Gg^{ukn)In{i^kn)\\ > ^K) = o(l), a.S. (Px) ■ 

Lemma 7.7. Let ch°{B) denote the interior of the convex hull of a set 
B CM^. Under conditions (C.0)“(C.3) and (C.5)', it holds that, as n—)-oo, 
P(0 e ch°{Ge,{u:kn)in{u^kn)}ti) ^ 1 a.s. (Px). 

7.2. Proofs of the main results. We now present the proofs of the re¬ 
sults from Section 5. In the following, references to the equations from the 
supplementary material [3] are given as (S.*). 


Proof of Theorem 5.1. By Lemma 7.7, 77.„,(0o) exists and is positive 
on a set with probability tending to one, a.s. (Px)- When 77„(0o) > 0 holds, 
by a general and standard EL result based on Lagrange multipliers (cf. Owen 
[33], page 100), one can express Tln{Go) as 

N 

(7.1) nn{eo) = ll{i + ik)-\ 

k=l 

where Idg^ = satisfies Pn(6'o,/30o) = 0 for 


N 


P„(0,/3) = iV-i^- 


Gg {u)kn)kn) 


k^l 1 + f3'Gg{uJkn)In{^kn) 





22 


S. BANDYOPADHYAY, S. N. LAHIRI AND D. J. NORDMAN 


and where 7 ^ = '^k,n = l3g^GeQ{ukn)in{^kn) satisfies | 7 fc| < 1 for all 1 < 
k < N. To prove the theorem, it is enough to show that given any sub¬ 
sequence {rii}, there exists a further subsequence {n^} of {uj} such that 

— logTZni^iOo) -A Xp- We use this line of argument because, as the proof indi¬ 
cates, the asymptotic expansion of —logTln^{9o) involves mean-like quanti¬ 
ties (e.g., term in the following) which may have differing (normal) limit 
distributions along different subsequences of {uk}', nevertheless, the log-ratio 
statistic — log7^n(^o) is shown to have a single, well-defined chi-square limit. 

Note first that, under the PID structure here, it follows immediately from 
(C.3) [cf. (S.IO)] that 


(7.2) 6 ^ ~ Nc^ ^ and 


N 

Sn - 2 C“^ [<7(0)]^ ^ ((^fcn)G'0o i^knY 

k=l 


o{N) 


[which is applied to show (7.4) from (7.3) next]. Fix a subsequence {nj}. 
Then by (C.3)(iii) and the fact that ||G 6 )p(a;)|| < C for all u £ it follows 
that there exist a subsequence {n^} of {rii} and a nonsingular matrix T* 
(possibly depending on {n^}) such that 


N 

(7.3) N~^J2G9o{^jn)G0o{ujny^T* through {nfc}. 

f=i 

For simplicity, replace the subscript by k and set , Uj = > 

7i = 7 i,n, and = Peo,n^- Also, let Wk = N-^ 

and Jfc = N~^Yfj^iGeoY^kn)ik{^j)- Then by (7.2), (7.3), condition (C.3) 
and Lemmas 7.5-7.6, we have, a.s. (Px), 

(7.4) \Jk\ = Op{N-^/^) and \\Wk-{2Nk)-^T.k\\= Op{l) 

as A:—)-oo. Since —>• 2 [(t( 0 )]^cF^F*, Wk 'is nonsingular whenever ||P4 — 

[(t( 0 )]^cF^F*|| is sufficiently small. 


Claim, \\l5e, || = Op{N^ a.s. (P^). 


Proof. Write =toUo where jjuo 

^=\\Fk{eo,M\\>KFk{eoM)\ 



^o['^Ge^{u:j)ik{u:j) -toY^ 
\j=i j=i 


= 1 and to = 11/300II- Then 


Gg,{uj)Ik{ujj) u[) Ggg (ujj ) I 

l + tou'(^Geo{ujj)in{ujj) 



^ tpu'QWkUo 
1 -|- toYk 


^l^j Jfcl, 


f=i 
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where Yfc = maxi<j<7Vfc ||G6»o(^i)lll-^A:(^i)l ei,...,er denote the stan¬ 

dard basis of ffi'’, with e* G MJ’ having a 1 in the ith position and 0 else- 
where. By Lemma 7.6, = Op{Np, ). Also, using (7.4), one can conclude 

that UQWfcUo >cro-|-Op(l) and hence, {l + tQYk)~HQ = a.s. (Px), 

where Uq > 0 is the smallest eigenvalue of [cj(0)]^c“^r*. Hence, it follows that 
to = \\Peo\\= Opi^k )> proving the claim. □ 

By the Claim and Lemma 7.6, 

(7.5) max | 7 j| < W^Yk = Op{N~^^‘^)op{Nl^‘^) = Op{l) a.s. (Px)- 

^<J<Nk 

Next, we obtain a stochastic approximation to j3eQ- Using Fk{6o,/3eo) = 0, 
note that 


Nk 


0 = V- ; 

^ ^i + d' 


Geo{<^j)h{<^j) 


^ 1 + /^eo J) 


Nk 


Gqo i^j ) i^j 
i=i 


1 - 7i + 


7| 


l + 7 i. 


i=i 


l+ 7 i 


Therefore, we have the representation 

( 7 . 6 ) l3eo = {Wk)-^Jk + Vk, 

where, using condition (C.3), Lemma 7 . 5 , the Claim, and ( 7 . 5 ), H^fcll < 

U|l&„PI|iri||->{Wt‘E,'iil|G».K)IP4"K)}{maxia<„.(l - |7,j|)->) = 

‘)Op(l)Oj(l)0,(l) =Oj(W;‘G), a.s. (Px). For ||&J|n < 1, 
applying a Taylor series expansion, we have 

log (1 -L 7 j) = 7 j - 7^/2 -L Aj, 

where |Aj| < \\/3gJ^Yk\\GBo{oj'j)\\‘^il{ujj){l - \\/3g^\\Yk)~^ for all 1 < j < Nk- 
Also, by Lemmas 7 . 4 - 7 . 5 , (C.3) and ( 7 . 4 ), NkJk{Wk)~^Jk “^Xp 

Nk f 1 

^|A,|<Afc||/30j|3n(l-||/30j|yfc)-3 iV-i^||G,3(^,)f4'(^i) 

j=i { j=i } 

= NkOp{N;^^\piNlG)Op{l)Op{l) = Op{l), 
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a.s. (-Px)' Hence, it follows that 


Nk 

-logP„J(9o) = -logPfc(6»o) = ^log(l + 7j) 

i=i 

-Nk Nk 

lj=i j=i 


.7j 


Nk 

i=i 




4[iVfcJfc] + ^A, 

i=i 


= 2-^Nk4{Wk)-^Jk + Op(l) A 

This completes the proof of Theorem 5.1. □ 


Proof of Theorem 5.2. By conditions on Cn, N and Xn in the MID 
case of part (a), 


and 

k=l 

where c~^ = o(l). Thus, hn has a slower growth rate in this case compared 
to the PID case. As in the proof of Theorem 5.1, it is enough to show that 

— logTZuki^o) Xp through some subsequence {n^} of a given subsequence 
{rii}. Indeed, the subsequence {n^} is extracted using (C.3)(iii) as before so 
that (7.3) holds. Let Y^, and 7 j be as defined in the proof of Theorem 5.1, 
and here we continue to use the convention that the subscript Uk is replaced 
by k, as before. Next, redefine Jk and H4 as Jk = 6^^ Ylf=i Geo{^kn)Ik{‘^j) 
and Wk = J2f=i Geo{<^j)Geo{<^j)'ik{'^j) where, following the convention, 
we write bk = bnk - Then, by (7.3), (7.7), Lemma 7.5 and (C.4), 

\\Wk-[a{0)fr*\\ = o(l) and 

6fcJfcP4iV(0,2[c7(0)]V). 

Further, retracing the proof of Theorem 5.1 and using Lemmas 7.5-7. 7, one 
can conclude that a.s. (Px); ||/36»oll = (^f- Claim), max{| 7 j|: 1 < 

j < Xlk} = Op(l) [cf. (7.5)] and the representation (7.6) holds with rjk = 
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Op{h^^). Hence, it follows that 


-log7^fc(6»o) 


\Nk Nu ■ 


-i=i i=i 


Nk 

+ 


i=i 


Nk 

= K [bpk] - 2-^lP'^W,Peo] + E 


j=i 


= 2-%4{W,)-^J, + Op{l) A xl- 

This completes the proof of Theorem 5.2(a). 

Next, consider part (b). Note that in this MID case, Nc~^ <C and 
hence, 6^ ~ Also, using the boundedness of ||G'0o(-)ll over and con¬ 
ditions (C.3)(i), (ii), (iv) and the DOT, one gets 

||s„-2A;;''r|| = o(A:;'='), 

where D = J" Gog{u>)Ggg{uyK‘^(j)'^{(jj)du: is nonsingular. Now retracing the 
proofs of Theorems 5.1 and 5.2(a) (with {n^} replaced by the full sequence 
{n}), one can show that —21og7?.n(0o) = bnJon{^On)~^Jon + Op{l), where 
Jon = b~^ Ylf=l Geoy^jn)in{^j) and Won = E^l Geo{tOjn)Geo{<^jn)'il{^jn)- 
Note that by Lemma 7.5 and the fact that 6^ ~ X'^^, we have 

||W„-6-2s,||=o(1), 

which is different from the previous two cases covered by Theorems 5.1 
and 5.2(a) [where ||1T„ — 2“^(6“^S„)|| = o(l)]. In view of (C.4), this implies 

that 6^ j'^(VLon)“Von Xp, proving part (b). □ 


Remark 7.1. From the proof of Theorems 5.1-5.2, it follows that the 
different scalings in the two cases are required by the dominant term in the 
asymptotic variance of the sum Geo(^A;n)-^n(<^A;n) and the automatic 

variance stabilization factor, both of which arise from the inner mechanics 
of the SFDEL. Under PID and under “slow” MID, the leading term is given 
by A^c“^cj(0), which is of a larger order of magnitude than X'^^. When the 
infilling rate is high, that is, Nc~‘^ <C A()'^, the other term involving the 
spectral density of the Z(-)-process dominates (as in the case of regularly 
spaced time series FDEL) and the standard scaling by —2 is appropriate. 


Proof of Theorem 5.3. By Lemma 7.5 and the fact that (T„(0) — 
it(0) = Op{Xn^^‘^) (cf. Lahiri [18]), under the conditions of Theorem 5.2(b), 




an{0o) 
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bn^Ef=l + 0 ,( 1 ) ^ ^ ^ 

bn^Ef=l \\Go,iu^jnW[2K^^{uj^)]+o,{l) 
while under the conditions of Theorems 5.1 and 5.2, 

= 2-‘(l+Op(l)). 

Now combining this with the proofs of Theorems 5.1-5.2, one can com¬ 
plete the proof of Theorem 5.3. □ 
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SUPPLEMENTARY MATERIAL 

Supplement to “A frequency domain empirical likelihood method for ir¬ 
regularly spaced spatial data” (DOI: 10.1214/14-AOS1291SUPP; .pdf). De¬ 
tails of proofs and additional simulation results. 
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